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O Abstract 

o 

CN We investigate the global scheduling of sporadic, implicit deadline, real-time task systems 

on multiprocessor platforms. We provide a task model which integrates job parallelism. We 
prove that the time- complexity of the feasibility problem of these systems is linear relatively 
to the number of (sporadic) tasks for a fixed number of processors. We propose a scheduling 
algorithm theoretically optimal (i.e., preemptions and migrations neglected). Moreover, we 
provide an exact feasibility utilization bound. Lastly, we propose a technique to limit the number 
of migrations and preemptions. 
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O 1 Introduction 

CO 

The use of computers to control safety-critical real-time functions has increased rapidly over the 
past few years. As a consequence, real-time systems — computer systems where the correctness 
of each computation depends on both the logical results of the computation and the time at which 
^ these results are produced — have become the focus of much study. Since the concept of "time" is 

of such importance in real-time application systems, and since these systems typically involve the 
sharing of one or more resources among various contending processes, the concept of scheduling 
is integral to real-time system design and analysis. Scheduling theory as it pertains to a finite set 
of requests for resources is a well-researched topic. However, requests in real-time environments 
are often of a recurring nature. Such systems are typically modeled as finite collections of simple, 
highly repetitive tasks, each of which generates jobs in a very predictable manner. These tasks have 
bounds upon their worst-case execution requirements and their periods, and associated deadlines. 

In this work, we consider sporadic task systems, i.e., where there is at least T, (called the period) 
time units between two consecutive job instances of the same task. A job which occurs at time t 
must be executed for at most Q time units in the time interval [t, t + D,-) (where Dj is the relative 
deadline). A particular case of sporadic tasks are the periodic ones for which the period is the 
exact temporal separation between the arrival of two successive jobs generated by the task. We 

"This is an extended version of the paper Integrating Job Parallelism in Real-Time Scheduling Theory, published in the 
journal Information Processing Letters. Ben Rodriguez 1 " pointed that the definition of work-limited parallelism did not 
express constraints that we were using implicitly in the remainder of the paper. Consequently we adapted the definition 
and clarified the proof. 
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shall distinguish between implicit deadline systems where D, = T„ Vz; constrained deadline systems 
where D, < Tj, Vz; and arbitrary deadline systems where there is no constraint between the deadline 
and the period. Moreover, we assume that the various tasks are independent (i.e., except the m 
processors there are no other shared resources, no critical sections nor precedence constraints). 

The scheduling algorithm determines which job[s] should be executed at each time instant. 
We distinguish between off-line and on-line schedulers. On-line schedulers construct the schedule 
during the execution of the system; while off-line schedulers mimic during the execution of the 
system a precomputed (off-line) schedule. Remark that if a task is not active at a given time instant 
and the off-line schedule planned to execute that task on a processor, the latter is simply idled (or 
used for a non-critical task). 

When there is at least one schedule satisfying all constraints of the system, the system is said to 
be feasible. More formal definitions of these notions are given in Section [2j 

Uniprocessor sporadic (and periodic) real-time systems are well studied since the seminal paper 
of Liu and Layland [1T21 which introduces a model of implicit deadline systems. For uniprocessor 
systems we know that the worst case arrival pattern for sporadic tasks corresponds to the one of 
(synchronous and) periodic tasks (see, e.g. Ifl4ll ). Consequently, most of the results obtained for 
periodic tasks remain for sporadic ones. Unfortunately, this is not the case upon multiprocessors 
due to scheduling anomalies (see, e.g. |fl]|). 

The literature considering scheduling algorithms and feasibility tests for uniprocessor schedul- 
ing is tremendous. In contrast for multiprocessor parallel machines the problem of meeting timing 
constraints is a relatively new research area. 

1.1 Model 

We deal with jobs which may be executed on different processors at the very same instant, in which 
case we say that job parallelism is allowed. For a task t; and m identical processors we define a m- 

def 

tuple of real numbers T, = {yi,\, ... , yi, m ) with the interpretation that a job of t, that executes for t 
time units on / processors completes yu x t units of execution. Full parallelism, which corresponds 
to the case where r ; = (1,2, ... ,m) is not realistic; moreover, if full parallelism is allowed the 
multiprocessor scheduling problem is equivalent to the uniprocessor one (by considering, e.g., a 
unique processor m times faster). 

In this work, we consider work-limited job parallelism with the following definition: 

Definition 1 (work-limited parallelism). The job parallelism is said to be work-limited if and only if 
for all Yi we have: 

VI < i < n, VI < < / < m, 

]' Yi.f , 
— > — - and 

1 Yi,j 
Yi,(i'+i) ~ Yi.f ^ Y CO'+i) _ Yij 
Note that the last restriction is equivalent to 

Ym'+c) ~ Yif Ytij+d) ~ Yi.j 
c ~ d 

For instance, the m-tuple Tj = (1.0,1.1,1.2,1.3,4.9) is not a work-limited job parallelism, since 
yj ; 5 = 4.9 > 1.3 x | = 1.625. These restrictions may at first seem strong, but are in fact intuitive: 
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we require that parallelism cannot be achieved for free, and that even if adding one processor 
decreases the time to finish a parallel job, a parallel job on f processors will never run j'/j times 
as fast as on j processors. Moreover, if going from j to j + 1 processors implies some performance 
loss, then going from + 1 to j + 2 processors must impact the performance by at least the same 
amounlQ 

Many applications fit in this model, as the increase of parallelism often requires more time to 
synchronize and to exchange data between parallel processes. Remark that work-limited paral- 
lelism requires that for each task (say t,), the quantities yu are distinct (y,- 1 < y, 2 < y/,3 < • • •)• 

1.2 Related research 

Even if the multiprocessor scheduling of sporadic task systems is a new research field, important 
results have already been obtained. See, e.g., |]2j [4j [3j [T5J for details. 

All these works consider models of tasks where jobs use at most a single processor each time 
instant. This restriction is natural for the uniprocessor scheduling since only one processor is 
available at any time instant even if we deal with parallel algorithms. Nowadays, the use of parallel 
computing is growing (see, e.g., IfTTll ): moreover, parallel programs can be easily designed using 
the Message Passing Interface (MP I [8j [9j) or the Parallel Virtual Machine (PVM flTll|6|]) paradigms. 
Even better, sequential programs can be parallelized using tools like OpenMP (see [j5]| for details). 
Therefore for the multiprocessor case we should be able to describe jobs that may be executed on 
different processors at the same time instant. For instance, we find such requirements in real-time 
applications such as robot arm dynamics [17], where the computation of dynamics and the solution 
of a linear systems are both parallelizable and contain real-time constraints. 

Few models and results in the literature concern real-time systems taking into account job 
parallelism. Manimaran et al. in [13]] consider the non-preemptive EDF scheduling of periodic tasks, 
moreover they consider moldable tasks (the actual number of used processors is determined before 
starting the system and remains unchanged), while we consider malleable tasks (the number of 
assigned processors to a task may change during the execution). Meanwhile, their task model and 
parallelism restriction (i.e., the sub-linear speedup) is quite similar to our model and our parallelism 
restriction (work-limited). Han et al. in IfTOll considered the scheduling of a (finite) set of real-time 
jobs allowing job parallelism. Their scheduling problem is quite different than our, moreover they 
do not provide a real model to take into account the parallelism. 

1.3 This research 

In this paper, we deal with global scheduling of implicit deadline sporadic task systems with work- 
limited job parallelism upon identical parallel machines, i.e., where all the processors are identical 
in the sense that they have the same computing power. We consider the feasibility problem of 
these systems, taking into account work-limited job parallelism. For work-limited job parallelism 
we prove that the time-complexity of the feasibility problem is linear relative to the number of tasks 
for a fixed number of processors. We provide a scheduling algorithm. 

To the best of our knowledge there is no such result in the literature and this manuscript pro- 
vides a model, a first feasibility test and a first exact utilization bound for such kind of systems. 

1 Ben Rodriguez pointed out that we implicitly use this last restriction in the remainder of the paper, and that it is thus 
required in the definition of work-limited parallelism. 
2 Job migration and preemption are allowed. 
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1.4 Organization 



This paper is organized as follows. In Section [2j we introduce our model of computation. In 
Section |3j we present the main result for the feasibility problem of implicit deadline sporadic task 
systems with work-limited job parallelism upon identical parallel machines when global scheduling 
is used. We prove that the time-complexity of the feasibility problem is linear relative to the number 
of tasks when the number of processors is fixed. We provide a linear scheduling algorithm which 
is proved theoretically optimal and we give an exact feasibility utilization bound. In Section |4j we 
propose a technique to limit the number of migrations and preemptions. We conclude and we give 
some hints for future work in Section [5j 



2 Definitions and assumptions 

We consider the scheduling of sporadic task systems on m identical processors {p\,p2,- ■ ■ ,Pm\- A 
task system t is composed of n sporadic tasks x\, % n , each task is characterized by a period 
(and implicit deadline) Tj, a worst-case execution time Q and a m-tuple Tj - (yi,i, ji,2, ■■■ , Yi,m) to 
describe the job parallelism. 

def 

We assume that y/ o = (Vz) in the following. A job of a task can be scheduled at the very same 
instant on different processors. In order to define the degree of parallelization of each task t ; - we 
define the execution ratios yu,Vj e [1,2, ... ,m) associated to each task-index of processor pair. A 
job that executes for t time units on j processors completes yu x t units of execution. In this paper 
we consider work-limited job parallelism as given by Definition [T] 

def 

We will use the notation t, = (Q, T;, r,), Vz with T, = (y^i, y^, . • • ,Yi,m) with y^ < yi^ < ■ ■ ■ < 
yi /tn . Such a sporadic task generates an infinite sequence of jobs with T; being a lower bound on 
the separation between two consecutive arrivals, having a worst-case execution requirement of Q 

units, and an implicit relative hard deadline T,. We denote the utilization of t; by zz, ^ f S. In our 
model, the period and the worst-case execution time are integers. 

A task system t is said to be feasible upon a multiprocessor platform if under all possible sce- 
narios of arrivals there exists at least one schedule in which all tasks meet their deadlines. 



2. 1 Minimal required number of processors 

Notice that a task t, requires more than k processors simultaneously if Uj > y^; we denote by /q 
the largest such k (meaning that kj is the smallest number of processors] such that the task t ; is 
schedulable on fc ; + 1 processors): 

k^[° ifUi ~ ni (1) 

lmax?.{fc | yijc < Ui} otherwise. 

For example, let us consider the task system x - {t\, T2} to be scheduled on three processors. 
We have x x = (6,4Ji) with Ti = (1.0,1.5,2.0) and t 2 = (3,4,r 2 ) with T 2 = (1.0,1.2,1.3). Notice that 
the system is infeasible if job parallelism is not allowed since z\ will never meet its deadline unless 
it is scheduled on at least two processors (i.e., ki = 1). There is a feasible schedule if the task t\ is 
scheduled on two processors and t 2 on a third one (i.e., ki = 0). 
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2.2 Canonical schedule 

Definition 2 (schedule a). For any task system z - {ti,...,t„} and any set of m processors 
{pi,...,p m } we define the schedule o(t) of system z at instant t as a : R+ — > {0,1, . . . ,n} m where 

def 

a(f) = {ai{t),a2{t),...,a m {t)) with 



if there is no task scheduled on pj at instant t 

1 if Tj is scheduled on pj at instant t 



for all 1 < j < m. 



We will now define canonical schedules. In what follows we will prove that it is always possible 
to find such a canonical schedule for all feasible task systems (see Theorem [5]) . Refer to Figure [2] 
for an example of a simple canonical schedule. Intuitively a schedule is canonical if it is a schedule 
where the tasks with higher indices are assigned to the highest available processors greadily. 

Definition 3 (canonical schedule). For any task system z = {z\, . . . , z„] and any set of m processors 
{pi, . . . ,p m }, a schedule a is canonical if and only if the following inequalities are satisfied: 

VI < < m, VO < t < t' < 1 : oft') < Oj(t) 

VI < ;' < / < m, Mt, t' e [0, 1) : Oj(t) < o f {t') 

and the schedule a contains a pattern that is repeated every unit of time, i.e., Vt e R +/ VI < ; < m : 
am = odt + 1). 



Without loss of generality for the feasibility problem, we consider a feasibility interval of length 
1. Notice that the following results can be generalized to consider any interval of length I, as long 
as I divides entirely the period of every task. 



3 Our feasibility problem 

In this section we prove that if a task system t is feasible, then there exists a canonical schedule in 
which all tasks meet their deadlines. We give an algorithm which, given any task system, constructs 
the canonical schedule or answers that no schedule exists. The algorithm runs in 0(n) time with n 
the number of tasks in the system. 

We start with a generic necessary condition for schedulability using work-limited parallelism: 

Theorem 4. In the work-limited parallelism model and using an off-line scheduling algorithm, a 
necessary condition for a sporadic task system z to be feasible on m processors is given by: 

LL u i-7i,k, \ 
[ki + \<m 
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Proof. As t is feasible on m processors, there exists a schedule a meeting every deadline. We 

def 

consider any time interval [t, t + P) with P = lcmfTi, T2, . . . , T n }. 

Let an denote the duration where jobs of a task t, are assigned to processors on the interval 
[t, t + P) using the schedule o. The sum YJf=\ j • a i,j gives the total processor use of the task t, on the 
interval (total number of time units for which a processor has been assigned to t,). As we can use 
at most m processors concurrently we know that 

n m 

otherwise the jobs are assigned to more than m processors on the interval. If on some interval 
of length I, Tj is assigned to processors, we can achieve the same quantity of work on f > j 
processors on an interval of length £^-. In the first case, the processor use of the task i is I j, 

Yi,f 

while in the second case it is If y^. By the first restriction that we enforced on the tuple T, (see 
Definition [l), we have 

nr Yi,? 
> *i 



Let a' be a slightly modified schedule compared to a, where Vz + i'/V/,«J . = fl^y. For the task %{>, it 
is scheduled on f processors instead of ; < / in o for some interval of length I , i.e. 

a 'i',j = a i',i ~ t 

> _ 

a i',r ~ ai 'j + c z~ 

' Yi,)' 

Then, for that task t,v , 

m m 

j=l j=l 

This proves that increasing the parallelism yields an increased sum. It remains to prove that it 
is not better to increase the parallelism to f > j processors on some interval t in order to decrease 
it to /" < j processors on some other interval £' (see Figure [TJ. 

The quantity of work originally achieved on the interval £ + €' is (€ + €')■ yu while the processor 
use is (£ + £') ■ j. After the change, the quantity of work is iyu> + t'yu" for a processor use of 
ij'+i'j". 

Suppose the processor use is not changed, and we shall show that the quantity of work has 
decreased. We start by noting that as the processor use is not changed, (£ + £') ■ j = if + £' j" , and 
we get 

t(f-i)=m-n (2) 
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Figure 1: Increasing the parallelism on some interval I to be able to decrease it on t' . 
From the last restriction enforced by work-limited parallelism (see Definition [TJ, we know that 

Yi,r - m £ (n,j - Yij") ■ — p (3) 

Now we have all the tools needed: 

iy iif + t'yi,f = € \y ir ji - y^) + iy^ + €'yq - £' (y^ - y^) 

= £ (nr - - ? (n/ - Yi,r) + ^ + Oyy 

= * {Yij' - Yij) - £ {Yij - Yij") ^—y, + W + ^Yij (b y Equation^ 

^ £ {nj - Yij" ) JZTf-, ~ ^ {nj - Yij" ) JZTf + (£ + £' )YH Cby Equation [3]) 

< (t + nyi,j 



In other words we decreased the quantity of work for a fixed amount of processor use; if we want 
to keep the same quantity of work, we need to increase the processor use and thus the sum defined 
above. 

So, we proved that we should minimize the parallelism; as we want to derive a necessary 
condition, we schedule the task on the minimal number of processors required. A lower bound on 
the sum is then given by 

m - y^ 

YiJci+l ~ Yih 

which corresponds to scheduling the task on kj + 1 processors for a minimal amount of time, and 
on ki processors for the rest of the interval. Then 

m 

] ■ a i4 >k r P + ■ P 

j=1 ttfc+i _ Yih 



kj-P+ — : — ■ P 
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and thus 



n m 

YjTjj'^i - m ' p 

i=l ;=1 

f(krP+ Ui ~ m -p) < m-P 
\ Yijq+i ~ Yih I 

LL u i-7i,ki \ 
\ki + < m 
i=l \ 7ijc i+ i-nhl 

which is the claim of our theorem. □ 

Theorem 5. Given any feasible task system t, the canonical schedule meet all the deadlines. 

Proof. The proof consists of three parts: we first give an algorithm which constructs a schedule a for 
t, then we prove that a is canonical, and we finish by showing that the tasks meet their deadlines 
if t is feasible. 

The algorithm works as follows (see Algorithm [l} : we consider sequentially every task t; 3 with 
i = n,n - 1, . . .,1 and define the schedule for these tasks in the time interval [0, 1), which is then 
repeated. 

We calculate the duration (time interval) for which a task t, uses k\ + 1 processors. If we denote 
by £i the duration that the task t, spends on kj + 1 processors, then we obtain the following equation: 

ti 7i,ki+i + (! ~ 4) YVa = u i 
Therefore we assign a task t, to k\ + 1 processors for a duration of 

Uj - Yik 
Yik+i - Yik 

and to kj processors for the remainder of the interval, which ensures that the task satisfies its 
deadline, since each job generated by the sporadic task t, which arrives at time t receives in the 
time interval [t, t + T,) exactly T, x u\ - Cj time units. 

The task t„ is assigned to the processors (p m , ■ ■ ■ ,Pm-k„) ( see Figure [2]). If u n t Yn,k„+i, another 
task can be scheduled at the end of the interval on the processor p m -k„> as in does not require k„ + 1 
processors on the whole interval. 

We continue to assign greedily every task Zj, by first considering the processors with highest 
number. The schedule produced by the above algorithm is canonical as it respects the three con- 
straints of the definition: 

• on every processor we assign tasks by decreasing index, thus o\(f) is monotone and decreas- 
ing; 

• for all i < i' , if %{> is scheduled on a processor py, then %\ is assigned to a processor pj with 

i < f; 

• the schedule is repeated every unit of time. 
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Figure 2: Schedule obtained after scheduling the task %■ 



The last step is to prove that if our algorithm fails to construct a schedule, i.e., if at some point 
we run out of processors while there are still tasks to assign, then the system is infeasible. 

Let A; be the the total processor use of the task t, in every unit length time interval. In the case 
of a canonical schedule, A; corresponds to: 

"j - Yijq 
A; = kj + . 

Yi,ki+\ - Yijq 

So for instance if, in the canonical schedule, a task t; is assigned to A, = 2.75 processors, it means 
that it is scheduled on two processors for 0.25 time unit in any time interval of length 1, and on 
three processors for 0.75 time unit in the same interval. 

If our algorithm fails, it means that £" =1 A, > m, which by Theorem [4] implies that the system is 
infeasible. □ 

A detailed description of the scheduling algorithm is given by Algorithm [TJ If we consider the 
task system t = {ti,T2} given before we have k\ - 1 and k2 = 0. By using Algorithm [T] we obtain: 

o 3 (t) = 2, We [0,0.75) 
cr 3 (f) = l,Vie [0.75,1) 
o 2 (t) = l,Vte[0,l) 
oi{t) = l,Vf e [0,0.75) 
cri(f) = 0,Vf e [0.75,1). 

Notice that Algorithm [T] does not provide satisfactory schedules for problems for which the 
number of migrations and preemptions is an issue. We shall address this question in the next 
section. 



Corollary 6. In the work-limited parallelism model and using an off-line scheduling algorithm, a 
necessary and sufficient condition for a sporadic task system t to be feasible on m processors is given 
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Algorithm 1 Scheduling algorithm of implicit deadline sporadic task system x of n tasks on m 

processors with work-limited job parallelism 

Require: The task system x and the number of processors m 

Ensure: A canonical schedule of x or a certificate that the system is infeasible 

l: let j - m 

2: let t = 

3: let o p (t) = 0, Vf £ [0, 1), VI < p < m 
4: for i = n downto 1 do 
5: if Ui < yi t \ then 
6: let k = 
7: else 

8: let k = max™ =1 {k \ y ik < u { } 
9: end if 

10: for r = 1 upto kj do 
li: let Oj(t) = i,Vte [t , 1) 
12: Iet(T ; _i(0 = i/Vte[0 / to) 
13: let ;' = ;'- 1 
14: end for 

15: let imp = to + ^ 

16: if tap > 1 then 

17: let Oj(t) = i, Vf e [f , 1) 

18: let ;' = ;'- 1 

19: let f = 

20: let fmp = fmp - 1 

21: end if 

22: let Oj(t) - i, Vf 6 [fo, fmp) 

23: let f = tmp 

24: if j < then 

25: return Infeasible 

26: end if 

27: end for 
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by: 

£f\ yija+i-yitt) 

Please notice that Corollary [6] can be seen as feasibility utilization bound and in particular a 
generalization of the bound for uniprocessor (see [12]) where a sporadic and implicit deadline task 
system is feasible if and only if £" =1 u, < 1. Like the EDF optimality for sporadic implicit deadline 
tasks is based on the fact that Y!i=\ u i < 1 is a sufficient condition, we prove the optimality of the 

canonical schedule based on the fact that F,f-i ifa + - - } ' ,k J 1 < W2 is a sufficient condition. 



Corollary 7. There exists an algorithm which, given any task system, constructs the canonical schedule 
or answers that no schedule exists in 0(n) time. 

Proof. We know that the algorithm exists as it was used in the proof of Theorem [5} For every task, 
we have to compute the number of processors required (in 0(1) time, as the number of processors 
m is fixed), and for every corresponding processor define Oj(t) appropriately. In total, 0(n) time 
is required. □ 



4 Scheduling problem reduction 




Figure 3: Improved scheduling of t m _i and t„ 

Regarding optimality, we proved that each task (say t,) must use permanently kj processors] 
simultaneously, and that optionally t, has to use an additional processor partially (i.e., with a dura- 
tion strictly less than the unity in each interval of length 1). Regarding the number of migrations, 
we can however define a better schedule without loss of optimality. For instance, in the schedule 
given by Figure [2] task t„_i migrates between k n -\ + 1 processors each time unit. I.e., x n -\ uses k n -\ 
processors in [0,€ n ), k n -\ + 1 processors in [£„,£„ + i n -\\ and again k n -\ processors in [£ n + £ n -\,\), 
but not the very same processors as the ones used in [0,^]. Consequently, there is necessarily a job 
migration of t„_i each time unit. We can however assign k n -\ processors statically and permanently 
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to x n _i and an additional processor sporadically (for a duration of £ n -\ ■ T n -\, see Figure [3]). In 
terms of optimality we use exactly the same number of processors but the number of migrations 
is null (at least for x n _i in this example). Since we assign statically and permanently tasks to pro- 
cessors we can actually reduce our scheduling problem: the scheduling problem of the n (original) 
sporadic tasks upon m processors with work-limited parallelism can be reduced to a more studied 
and simpler scheduling problem: the scheduling of n' (n' < ri) sporadic tasks upon m' [m' < m) 
processors where job parallelism can be forbidden (without loss of optimality). 

We shall now formalize the scheduling problem reduction. Let {t\,T2, ■ ■ ■ ,T n } be our (original) 
sporadic task set to schedule on m processors. By the definition of the quantity kj (Eq. [T} and the 
optimality of Algorithm [2] (Theorem [5]), the scheduling problem can be reduced to the scheduling 

of the sporadic task set %' - \%'%' %t . . ■,?'„} with C' = ^ •T,-,T' = Tj upon m' - m - £" =1 k{ where 
job parallelism can be forbidden (without loss of optimality). 

While our (canonical) schedules are optimal (the original one and the improved one) for x, 
the number of preemptions can be too large (there is actually at least one preemption each time 
unit for the set of tasks i')- For the systems where this is an issue, since job parallelism can be 
forbidden for z', with potential loss of schedulability we can schedule %' using, for instance, global 
EDF (see [7]), global Rate Monotonic (see O) or using a partition scheme (see [1] for instance) 
in order to reduce nicely the number of preemptions. We shall analyze more precisely the case of 
using EDF to schedule the task sub-set %' in the next section. 

4.1 Scheduling %' using EDF 

In order to reduce strongly the number of preemptions in the schedule of task sub-set %' we can 
use EDF (or one of its variants) instead of the canonical schedule. We know that the number of 
preemptions using EDF is bounded from above by the number of jobs in the set (and consequently, 
the total number of context switches is bounded by twice the number of jobs). It can similarly be 
shown that the total number of interprocessor migrations of individual jobs is bounded from above 
by the number of jobs. We know that the task sub-set t' is schedulable using the optimal canonical 
schedule since the following necessary and sufficient condition is satisfied: 

E u 'i - m ' 

Unfortunately EDF is not optimal in terms of the number of processors required and the con- 
dition above is only necessary. Using, e.g., EDF-US[l/2] (which gives top priority to jobs of tasks 
with utilizations above 1/2 and schedules the remaining jobs using EDF) Baker |[3l proved correct 
the following sufficient (not exact) feasibility test: the task sub-set %' is schedulable by EDF-US [1/2] 
upon m processors if 

(2-£ M ;-l)<m 

t'gt' 

The trade-off is actually the additional number of processors required: \l ■ Lt'gt' u \ ~ l] _ m ' ■ 
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5 Discussions 



5.1 Job parallelism vs. task parallelism 

In this manuscript we study multiprocessor systems where job parallelism is allowed. We would 
like to distinguish between two kinds of parallelism, but first the definitions: task parallelism allows 
each task to be executed on several processors at the same time, while job parallelism allows each 
job to be executed on several processors at the same time. If we consider constrained (or implicit) 
deadline systems task parallelism is not possible. For arbitrary deadline systems, where several 
jobs of the same task can be active at the same time, the distinction makes sense. Task parallelism 
allows the various active jobs of the same task to be executed on a different (but unique) processor 
while job parallelism allows each jobs to be executed on several processors at the same time. 

5.2 Optimality and future work 

In this paper we study the feasibility problem of implicit deadline sporadic task systems with work- 
limited job parallelism upon identical parallel machines when global scheduling is used. We prove 
that the time-complexity of our problem is linear relative to the number of tasks. We provide an 
optimal scheduling algorithm that runs in 0(n) time and we give an exact feasibility utilization 
bound. 

Our algorithm is optimal in terms of the number of processors used. It is left open whether 
an optimal algorithm in terms of the number of preemptions can be designed. As a first step, we 
used an interval of length 1 to study the feasibility problem. Without loss of optimality, we can 
improve our algorithm to work on an interval of length equal to the gcd of the periods of every 
task, which decreases the number of preemptions and migrations. We do not know, however, if this 
improvement minimizes the number of preemptions and migrations. 

The definition of work-limited job parallelism was given here for identical processors. One 
should investigate an extension of this definition to heterogeneous platforms. 
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